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Abstract. Accessible groups whose word problems are accepted by a 
deterministic nested stack automaton with limited erasing are virtually 
free. 



1. Introduction. 

During the past several years combinatorial group theory has received an in- 
fusion of ideas both from topology and from the theory of formal languages. 
The resulting interplay between groups, the geometry of their Cayley dia- 
grams, and associated formal languages has led to several developments in- 
cluding the introduction of automatic groups || , hyperbolic groups |L2| , and 
geometric and language-theoretic characterizations of virtually free groups 

m 

We will restrict our attention to finitely generated groups. For any such 
group the language of all words which define the identity is called the word 
problem of the group. By words we mean words over the generators. Of 
course the word problem depends on the choice of generators. In [|17| virtu- 
ally free groups are shown to be exactly those groups whose word problem 
with respect to any set of generators is a context-free language. We are in- 
terested in investigating groups whose word problems lie in other language 
classes. 

Formal languages are often defined in terms of the type of machine which 
can tell whether or not a given word is in the language. Such a machine is 
said to accept the language. Context-free languages are accepted by push- 
down automata, and those context-free languages which are word problems 
are accepted by the subclass of deterministic limited erasing pushdown au- 



tomata [17, Lemma 3]. In this paper we show that the more powerful class 
of deterministic limited erasing nested stack automata accept exactly the 
same word problems. 
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Theorem 1.1. Suppose G is an accessible group whose word problem is 
recognized by a deterministic nested stack automaton with limited erasing 
which accepts by final state and empty stack; then G is virtually free. 

The question of whether or not every group whose word problem is ac- 
cepted by a nested stack automaton is virtually free has been open for some 



time. Some possible counterexamples are proposed in [15|. Notice our as- 
sumption that G is accessible. While it is not difficult to show that a group 
with context-free word problem is finitely presented (and therefore accessi- 
ble), deciding the same question for groups whose word problem is accepted 
by a nested stack automaton seems much harder. 

In |l7| and Jl8j virtually free groups are characterized by geometric con- 
ditions on their Cayley diagrams as well as by language-theoretic conditions 



on their word problems. In the course of proving Theorem 1.1 we are led to 
another such geometric condition. To express this condition we recall that 
a choice of generators determines a word metric on a group. The distance 
between two group elements g and h is the length of the shortest word rep- 
resenting g~ l h. This metric is just the restriction to vertices of the path 
metric in the corresponding Cayley diagram. Different word metrics for the 
same group are quasi-isometric. (See Definition ^^.) We will assume that 
every group is equipped with a word metric. 

Definition 1.2. A group is narrow if there exists an integer i such that for 
any ball B and all but finitely many other balls B' of the same radius, B is 
separated from B' by a set of size at most i. A group which is not narrow 
is wide. 

Two subsets are separated by a set S if every path between them intersects 
S. By path we mean a finite sequence of points each a distance one from its 
successor. In other words a path is just the sequence of vertices occurring 
in a path in the Cayley diagram. 

Whether or not a group is narrow seems to depend on its word metric, 
but in fact it does not. 

Theorem 1.3. Narrowness is a quasi-isometry invariant of groups. 

In adition narrowness characterizes those accessible groups which are virtu- 
ally free. 

Theorem 1.4. If the group G is finitely generated and accessible, then the 
following conditions are equivalent. 

1. G is wide; 

2. G is not virtually free; 

3. G contains a one-ended subgroup. 

If G is wide, then a one-ended subgroup can be found by splitting G over 
finite subgroups. 

Narrowness is related to two geometric conditions shown in |l8j to char- 



acterize virtually free groups. By [18, Theorem 2.9] any finitely generated 
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group G (accessible or not) is virtually free if and only if the components 
of the complements of all balls fall into finitely many isomorphism classes 
of labeled graphs. In fact by a remark in the proof of that theorem, G is 
virtually free if and only if the frontiers of the components have uniformly 
bounded size. Clearly this condition implies that G is narrow, as any ball 
B is separated from all but finitely many balls of the same radius by the 
frontiers of the components of the complement of B. Conversely if G is 
narrow and accessible, then by [^] and Theorem 1.4, G satisfies the other 
two geometric conditions. We do not know if there are narrow groups which 
are not virtually free. 

We are indebted to Swarup Gadde, Chuck Miller, Walter Neumann, 
Christophe Pittet, and Nick Wormald for helpful conversations and in par- 
ticular to Pittet for an argument which appeared in earlier versions of this 
paper. The second author wishes to extend special thanks to Rostislav 
Grigorchuk for his hospitality, and for first introducing him to this problem. 

The reader is referred to Aho's original paper 0] on nested stack au- 
tomata, and for background on language theory to Hopcroft and Ullman's 
book [|lS] on automata theory. The books by Cohen ||] and Lyndon and 
Schupp [16[ are references for combinatorial group theory. 



2. Preliminaries. 



Definition 2.1. A map (p : X —> X' between metric spaces (X,d) and 
(X',d') is a quasi-isometry if there exists a positive constant k such that 

1. (l/k)d(x, y) -k < d(tp(x), (p(y)) < kd(x, y) + k; 

2. X' = U 9eX B k (<p(x)) 

For example inclusion of a group with the word metric corresponding to 
a choice of generators into the Cayley determined by the same generators is 
a quasi-isometry. 

Quasi-isometry of metric spaces is an equivalence relation. As different 
choices of finite generating set for a group yield quasi-isometric metrics, it 
follows that all Cayley diagrams for a finitely generated group are quasi- 
isometric. In fact Cayley diagrams for commensurable groups are quasi- 
isometric H Prop. 11, page 8]. 

Recall that all groups are assumed to be finitely generated. A virtually 
free group is one with a free subgroup of finite index; virtually cyclic groups 
are defined likewise. All finite groups are virtually cyclic. 

A group G is one-ended if for all r the complement in its Cayley diagram 
of the ball of radius r around the identity has exactly one infinite component. 
It is not hard to show that the validity of this condition is independent of 
the generating set of the group. A famous theorem of Stallings |2(| says 
that a group which is not one-ended is either virtually cyclic, or splits as 
a free product of two factors with a finite subgroup amalgamated, or is an 
HNN extension with one stable letter and finite associated subgroups. 
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In the latter two cases we have G = H\ *k H2 or G = (Hq, t \ t^ 1 K\t = 
K2) respectively for subgroups Hi of G. We call Hi a factor and Ki an asso- 
ciated subgroup. In either case we say that G splits over a finite subgroup. 

When a group G splits over a finite subgroup, it may be possible that a 
factor splits over a finite subgroup, and one of the factors of that splitting 
splits again etc. G is accessible if there is an upper bound on the length 
of any such chain of splittings. The least upper bound is the accessibility 
length of G. It is a result of Dunwoody [Q] that a finitely presented group 
is accessible. 

Lemma 2.2. An accessible group is either virtually free or contains a one- 
ended subgroup. 

Proof. If G does not split over a finite subgroup, then it is virtually cyclic 
or one-ended. If it does split, use induction on accessibility length together 



with results of Gregorac [11] and Karrass, Pietrowski and Solitar 14] which 
say that a group which splits over a finite subgroup is virtually free if each 
factor is. □ 



3. Machines. 

We begin with an informal account based on the original definition of nested 
stack automata as a certain kind of computer [jl]] . However this definition is 
unwieldy, so our subsequent formal definition is in terms of labeled graphs. 
This approach to automata theory is well established. See for example 
Brainerd and Landweber |^, Chapter 4], Eilenberg || Volume A, Chapter X], 
Floyd and Biegel @, Gilman ||, Goldstine flOtl , and Salomaa, Wood and 

Yu ra. 



xi 



X2 



a 


b 


c 







112 



X 3 




Figure 1. A nested stack automaton. 

Figure || shows a nested stack automaton A consisting of a finite one-way 
input tape holding a word form an input alphabet S, a finite set of internal 
states {qi}, and a memory which at each instant contains a finite directed 
tree with edge labels from a memory alphabet S. At each instant A is in a 
particular internal state, qi, and is either scanning a cell on the input tape 
or has moved off the tape to the right. Also A is pointing at a vertex of 
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its memory tree. The label of the inedge to this vertex is called the current 
memory symbol. If A is pointing to the root, the current memory symbol is 
the empty word e. 

A begins a computation in a designated initial state qo and scanning the 
leftmost cell on its input tape (or off the tape if the input is e). Initially 
the memory is empty; that is, the memory tree consists of just a single ver- 
tex. Each computation consists of a number of moves, and A is completely 
specified by a list of moves to be made for various combinations of internal 
state, input letter, and current memory symbol. The input letter is either 
the content of the current cell on the input tape or e. In the former case A 
moves right on the input tape as the last part of the move; in the latter case 
it does not. If A has moved off the input tape, only e-moves are possible. 
For any particular combination of internal state, input letter, and memory 
symbol there may be one, many or no moves specified. 

If there is a sequence of moves in which A moves off the input tape and 
reaches a configuration with empty memory and one of a designated set 
of final states, then A accepts the word on the input tape. The set of all 
accepted words is the language accepted by A. If A reaches a situation in 
which no move is possible, it halts. If the conditions just mentioned obtain, 
then the input is accepted, otherwise not. A need not halt after accepting 
an input, but its susequent behavior has no effect on the language accepted. 

In addition to updating the pointer to the input tape, the other parts 
of a move are the choice of a new internal state and a memory operation. 
There are four memory operations in addition to the trivial operation 1 in 
which the memory tree T is left unchanged. To define these operations we 
observe that during a computation vertices are added to and deleted from 
T. Thus at any given time the vertices are ordered according to when they 
were added to T. The root is the earliest vertex and is never deleted. The 
four memory operations are moving the pointer down to the the parent of 
the current vertex if that vertex is not the root, moving the pointer up to 
the latest child of a vertex if a child exists, deleting the current vertex if it is 
a leaf but not the root, and adding a new edge whose source is the current 
vertex and whose target is a new leaf. After deleting a leaf the current vertex 
is set to the source of the edge to that leaf, and after adding a new edge the 
current vertex is set to the new leaf. 

It follows from the definitions above that the the vertex pointed to by A is 
always on the path from the root to the latest vertex, which is a leaf. In par- 
ticular only the latest vertex can be deleted. Our description is not quite the 
same as the original in . There the memory tree of Figure [l] would be re- 
placed by a memory tape containing three nested stacks $xiX2$yi$^y2^3#- 
In addition we allow NSA's to operate on empty stack. We leave it to the 
reader to check that the two kinds of machine are equivalent in the sense 
that each can simulate the other. 

Now we will give a more precise definition of nested stack automaton. 
We fix once and for all an infinite countable memory alphabet 5. Each 
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NSA will use only finitely many letters from S, so this convention does no 
harm. We order the vertices of a finite tree by depth first search. That is, 
the root is earliest followed by all the vertices in order along a path to a 
leaf. Then we back up to the first vertex with an outedge which has not 
been traversed and continue along that edge to another leaf, etc. Edges are 
ordered according to their target vertices. If the tree in Figure |l| is ordered 
this way by taking the leftmost possible outedge at each opportunity, the 
corresponding ordering of edge labels would be X3, X2,xi,y2, yi, z. 

It is clear that for any tree ordered by depth-first seach the latest vertex 
is a leaf and deleting it gives a tree ordered in the same way. Likewise adding 
a new edge with source anywhere along the path from the root to the latest 
vertex and making the new leaf later than all the other vertices in the tree 
yields a tree ordered in the right way. 

Definition 3.1. The set T of memory trees consists of all finite trees T 
ordered by depth-first search with 

1. Root vertex vq] 

2. Edges labeled by letters from S; 

3. All edges directed away from the root; 

4. One distinguished vertex on the path from vq to the latest vertex of T. 
To is the tree consisting of just vq. 

Next we define a monoid of operators on memory trees. 

Definition 3.2. M nsa is the monoid generated under composition by cer- 
tain partial maps from T to itself. Pick T € T with distinguished vertex v, 
and let y be the label of the inedge to v. If v = vq, then y = e. We describe 
the effect of the partial maps on T. In each case if T does not satisfy the 
conditions given, then the map is not defined at T. 

D X (T), x € S: If x = y and v ^ vq, then D X (T) is obtained by changing 

the distinguished vertex of T to the parent of v. 
U X (T), x € H U {e}: If x = y and v is not a leaf, make the latest child of 

v the new distinguished vertex. 
P X (T), iGS: Add to T a new edge with source v, label x, and target 

a new vertex v\ . Make v\ the latest vertex of T and the new current 

vertex. 

Q X (T), x € E: If x = y and v is a leaf with parent v\, delete v and its 
inedge. Make v\ the distinguished vertex. 

Clearly M nsa acts by partial injective maps whence it is a submonoid of the 
symmetric inverse semigroup on T. M nsa has both an identity and a zero 
element, which we denote by 1 and respectively. 

Definition 3.3. Let £ be a finite alphabet. A nested stack automaton A 
over E is a finite directed graph with a designated initial vertex, designated 
final vertices, and with edges labeled by pairs (m, a) where 06SU {e}, and 
either m = 1 or m is one of the generators defined in Definition |3.2| . 
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Every (directed) path in A has a label (m, w) formed by multiplying the 
components of the edge labels in order. A path of length zero has label 

(M). 

Definition 3.4. A computation of a nested stack automaton A is a path 7 
which starts at the initial vertex of A and has label (m, w) for some m with 
T = m(To) defined. T is called the outcome of 7. The word w is accepted by 
A if there is a computation with label (w, m) and outcome T = Tq ending 
at a final state. These computations are called successful. The set of all 
accepted words is the language accepted by A. 

In other words A accepts w if it can read all of w, empty its memory, and 
stop at a final state. One can also consider NSA's which accept just by final 
state or empty memory, but we do not do so here. 



(P x ,a) (D x ,b) (U x ,c) 




Figure 2. An NSA A which accepts {a n b n c n d n }*. 

Figure | shows an NSA A which accepts {a n b n c n d n }* = {{a n b n c n d n ) k \ 
n > 0,k > 0}. Vertex 1 is both the start vertex and the single final vertex. 
To see that A accepts the language claimed, first observe that domain of 
U x is disjoint from the range of Q x . Thus a computation in the sense of 
Definition |3.4| cannot have an edge with label (Q x ,d) followed by one with 
label (U x ,c). Consequently the label of any successful computation by A is 
a product of terms of the form (P y P^,DiU y U x Q x Qy,a l b : 'c k d l ). In particular 
there is a successful computation whose label is the empty product, (l,e), 
so A accepts e. 

To show that the language accepted by A is as claimed, it suffices to show 
that the exponents i,j,k,l in any term above are equal and that all cases 
in which the exponents are equal and greater than can occur. Suppose 
(PyP x DiU y U x Q x Qy, a 1 ^ c k d l ) is the first term. A begins by constructing 
a memory tree T with a single branch labeled yx l . If i < j, then A tries 
unsuccessfully to move down to the root of T by executing D x while pointing 
at the vertex of T with inedge labeled y. If i > j, then A tries to move up 
past the leaf of T. Conseqently A reaches vertex 4 if and only if i = j > 
and k > 0. Additional arguments of a similar nature demonstrate that A 
returns to vertex 1 if and only iii=j = k = l>l and that upon its 
return T = Tq. But T = Tq imples that our analysis applies to each term in 
succession. 

It follows easily from the preceding paragraph that at any point in a 
computation the memory tree T has only a single branch. In other words A 
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is a stack automaton; operations P x are executed only when the distinguished 
vertex is a leaf. In addition it is clear from Figure [2] that A has at most one 
move for each combination of state and input symbol. Such automata are 
called deterministic. 

Definition 3.5. An NSA A over E is deterministic if each combination of 
vertex v, memory T, and letter a £ E admits at most one outedge with label 
(m, a) or (to, e) such that to(T) is defined. In other words the to's which 
occur in labels of these outedges have pairwise disjoint domains in T. 

Any computation of length n by a deterministic NSA can be continued in 
at most one way to a computation of length n + 1. 

Definition 3.6. An NSA A has limited erasing if there is a constant k 
such that every path in A with label (to, e) has at most k edges with labels 
involving Q. In other words the A makes at most k erasures from its memory 
without consuming input. 

The NSA in Figure [2] is deterministic and has limited erasing with k = 1. 

Lemma 3.7. Suppose L C E* is the language of all words accepted by the 
deterministic NSA with limited erasing A over E. If A is a finite alphabet 
and f : A* —* E* is a homomorphism which does not map any generator to 
the empy word, then / _1 (L) is also accepted by a deterministic NSA with 
limited erasing. 

Proof. Suppose first that / maps generators to generators. In this case we 
can construct the required NSA A' directly from A. Replace each edge with 
label (to, a), a € E, by a set of edges with labels (m,p) for all p € / _1 (a). 
The new edges have the same source and target as the edge they replace. 
If / _1 (a) is empty, the original ledge is simply deleted. It is clear from the 
construction that A' is deterministic with limited erasing. Further for any 
path in A from vertex v\ to v% with label (m, w) there is for each w' £ 
a path in A' from v\ to V2 label (m,p). Conversely every path in A' with 
label (m,w') projects to a path in A with label (m, f(w')). It follows that 
A' accepts / _1 (L). 

In general / factors as a product of homomorphisms of the type just 
considered and homomorphisms which map one generator into a word of 
length two and all other generators into themselves. Thus it suffices to 
prove / _1 (L) is an NSA language in this second case. More precisely we 
may assume that A and E are the same except that one generator a € A 
is replaced by a\ and a% in E. Further /(a) = a\a2 and / maps all other 
elements of A to themselves. 

Construct an NSA A 1 from the union of two disjoint copies of A, say A\ 
and A2- The idea here is to modify A\ U A2 so that paths in A correspond 
to paths in A\ except that subpaths with label sequences 



(mi,ai), (m 2 ,e), . . . , (m fc _i,e), (mjfc,a 2 ) 
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move over to A2 at the first edge and return to A\ at the last edge. Further 
the first label of such a subpath is changed to (mi, a) and the last to (m^, e). 
It is also necessary to prevent A' from emptying its memory at any vertex 
in Ai. For this purpose we add a new memory symbol z which is pushed on 
to the memory at the beginning of each computation and which can only be 
removed at final vertices in A\. 

More precisely A' is formed from the disjoint union A\ U A2 U {vq, v\} in 
the following way. 

1. The start vertex of A' is v$, and there is an edge from vq to the start 
vertex of A\ with label (P z , e); 

2. Every final vertex of A\ has an outedge to v\ with label (Q 2 , e); 

3. The single final vertex is v\. 

4. Each edge in A\ with a\ in its label has a\ changed to a and its terminal 
vertex changed to the corresponding vertex in N2; 

5. All edges from A2 except those with an e or ai in their label are re- 
moved; 

6. Each edge in A2 whose label involves a<i has ai changed to e and its 
terminal vertex changed to the corresponding vertex of N\. 

It is straightforward to check that A' is deterministic with limited erasing, 
and accepts f~ l {L). □ 



4. Group languages. 

In this section we develop the first properties of groups whose word problems 
are solvable by deterministic nested stack automata with limited erasing. 
From now on NSA will refer to an anutomaton of this type which accepts 
by final state and empty stack, and NSA language will mean a language 
accepted by an NSA. 

We fix some notation. A choice of generators for a group G is a surjective 
monoid homomorphism a : S* — > G from a finitely generated free monoid. 
We will write W for a(w) and assume that a choice of generators a : X* — > G 
always has formal inverses. That is, S is a union of pairs {a, a -1 } and 
o"(a _1 ) = (cr(a)) -1 . We emphasize that £ still generates £* freely as a 
monoid; there is no cancellation in E*. Recall that e denotes the empty 
word and e = 1. The word problem of G corresponding to a certain choice 
of generators is{wGS*|lZJ=l}. 

Lemma 4.1. If the word problem of a group G with respect to one choice of 
generators is an NSA language, then so are the word problem with respect to 
any choice of generators and the word problem for every finitely generated 
subgroup of G. 

Proof. Let a : S* — > G be a generating set for G such that an NSA solves the 
word problem for G with respect to £*, and 5 : A* — > H C G be a generating 
set for H. Choose a homomorphism /:£*—> A* such that 8 o / = a and / 



does not map any generator to the identity. Apply Lemma 3.7. □ 
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Now let A be an NSA over S accepting the word problem of G with 
respect to a choice of generators S* — > G. We use A to construct a graph 
A which covers both A and Q, the Cayley diagram of G. We augment Q by 
adding an edge with label e from every vertex to itself. 

Definition 4.2. A configuration of an NSA A is a pair (q,T) where q is a 
state of A and T is a memory tree. A configuration is accessible if T is the 
outcome of a valid computation 7 ending at q and if there is a continuation 
7' such that 77' is successful. 

Accessible configurations might more properly be called accessible and 
co-accessible. 

Definition 4.3. The configuration graph A of A has as vertices all accessible 
configurations. There is an edge from (q,T) to (q',T') with label a € SU{e} 
if and only if there is an edge with label (m, a) from q to q' in A such that 
m(T) = T' . The initial vertex of A is (qo,To) where go is the initial vertex 
of A. 




Figure 3. Part of the configuration graph of the NSA A of Figure 



In Figure ^ a vertex with label yx3x, say, stands for the configuration 
(3, T) in which T consists of one branch of length three with label yxx and 
distinguished vertex a distance two from the root. 



The the first half of the next lemma is clear from Definition 3.3 and 



Definition 3.4; the second half follows from Definitions 3.5 and 3.6 



Lemma 4.4. The following conditions hold. 

1. Every computation 7 of A which can be continued to a successful com- 
putation lifts uniquely to a path in A which starts at {qo,To) and has 
label equal to the first component of the label of '7. 
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2. // a computation which ends at q with outcome T lifts to a path starting 
at (qQ,TQ), then the lift ends at (g,T). Conversely any path in A from 
(go. To) to a vertex (q,T) is the lift of a computation with outcome T. 

3. Each vertex of A either has a single outedge with label e and no other 
outedges, or it has no outedges with label e and at most one outedge 
with label a for each a € E. 

4. For some constant K any path in A has at most K successive edges 
labeled e. 



Lemma 4.5. There is a homomorphism of labeled graphs ip : .A — > Q , where 
Q is Cayley diagram of G augmented by the addition of a loop with label e at 
every vertex. The image of a vertex in A is the group element represented 
by the label of any path from (go, To) to that vertex. In particular the initial 
vertex of A maps to 1. Every path Q starting at 1 lifts to a path in A which 
starts at (go, To) an d projects to a path differing from the original only by 
addition or deletion of edges with label e. The lift is unique up to a terminal 
segment with label e. 

Proof. By definition of A each vertex (g, T) is reached by a path from 
(go, To). Suppose there are two such paths with labels w and w' . By the 
definition of A again there is a path from (q, T) to some vertex (g', To). Let 
u be the label of this path. Then A accepts wu and w'u whence both denote 
the identity in G. It follows that w and w' represent the same element of G. 

Define <p by mapping each vertex of A to the group element represented by 
the label of any path from (go, To) to that vertex. In particular ip((qo, To)) = 
1. Suppose there is an edge from (q,T) to (q',T f ) with label a G E U { e}. 
Pick a path from (go, To) to (q, T), and let its label be w. As wa is the label 
of a path to (g',T'), it follows that if (p((q,T)) = g, then tp((q',T')) = ga. 
Thus (f is a graph homomorphism. 

Since <p is a homomorphism, the penultmate assertion amounts to showing 
that for every w € E* there is a path in A starting at (go, To) and with label 
w. But for some bGS*, wv represents the identity in G and hence there is 
a successful computation in A with label (m,w). The last assertion follows 



from the third part of Lemma 4.4. □ 



We pause to remark on a difference in configuration graphs of NSA's and 
pushdown automata which illustrates the different power of the two types 
of automaton. A pushdown automaton is a nested stack automaton, not 
necessarily quasi-realtime or deterministic, which has no labels involving 
D x or U x . In other words its memory trees all have just one branch and 
the distinguished vertex is always the leaf. We metrize graphs in the usual 
way by disregarding orientation and taking edges to be isometric to the unit 
interval. 



Theorem 4.6. The configuration graph of a pushdown automaton is quasi- 
isometric to a tree. This quasi-isometry is given by a quotient map. In 
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contrast to this, the configuration graph of a stack automaton ( and hence an 
NSA) may have arbitrarily large isometrically embedded loops. 

Proof. The second claim is clear from Figure ||. It remains to prove that 
the configuration graph of a pushdown automaton is quasi-isometric to a 
tree. Let A be the configuration graph of a pushdown automaton A. We 
will write (p, T) ~ (q, T) if there is an undirected path in A between (p, T) 
and (q, T) with the property that T is an initial segment of the memory tree 
of every intermediate vertex. That is to say, (p, T) ~ (q, T) if we can get 
from (p, T) to (q, T) by a sequence of forward and backwards moves without 
ever erasing any portion of T. 

Since A has no labels involving D x , the existence of such a path depends 
only on p, q, and the label of the inedge to the leaf of T. It follows that 
there is a universal bound on the undirected distance between (p, T) to 
(q,T) whenever (p, T) ~ (q,T). We denote the equivalence class of (p, T) 
by ([p],T) since T is constant throughout the class. This is an abuse of 
notation, since [p] depends on T. We console ourselves with the hope that 
T will be clear from context. 

Project A to the graph A whose vertices are equivalence classes of ~ with 
distinct classes joined by an unoriented edge if there is an edge between any 
two vertices in the preimages of the classes. In the case where an edge of 
A connects a vertex to itself, we project that edge of A to the image of its 
endpoints in A. We have seen that there is a universal bound on the size of 
( [p] , T) . It follows that the quotient map is a quasi-isometry. We will show 
that A has no simple cycles and is therefore a tree. 

Suppose to the contrary that there is a simple cycle in A. Since A has no 
loops and at most one edge between any two vertices, this cycle must have 
length at least three. We examine the cycle at a place where T has maximal 
size. Either there is a single edge between distinct vertices ([p],T) and 
([q],T), or there are two edges connecting distinct vertices ([p],T'), ([q],T') 
with T" a proper prefix of T to a vertex ([r],T"). The vertices ([p],T") and 
([q],T') have the same memory tree because their memory trees are shorter 
than T but derived from T by popping a single element. 

In the first case, there is an edge in A between (p',T) and (q',T) for 
some p' € [p] and q' € [q] . Consequently ( \p] , T) = ([q] , T) , and we have a 
contradiction. In the second case there is an undirected path from (p',T') 
to (g', T') along which each memory tree is an initial segment of T. Hence 
([p],T') = ([<?], T'), and our cycle is not simple. It follows that A is a 
tree. □ 

Notice that the proof above does not work for the usual kind of pushdown 
automata which allowed to push a string of letters onto the stack on one 
move. Our pushdown automata are restricted to pushing one symbol at 
a time. However is easy to simulate a more general automaton by adding 
internal states to obtain one of ours, and it is straightforward to show that 
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the configuration graphs of the two machines are quasi-isometric. Thus the 
result above holds for standard pushdown automata. 



Theorem 4.6 provides another way of proving the result of Muller and 
Schupp. For we now have the configuration graph A mapping to the Cayley 
graph Q and quasi-isometrically to a tree T. The lifting properties quickly 
show that Q is not one-ended and one proceeds as before. This points 
up a difference between our result and that of Muller and Schupp. In the 
case of pushdown automata, the treeness — and therefore, the freeness - 
are already implicit in the class of machines. In the case of nested stack 
automata, the freeness is a result of the class of automata together with the 
fact that they are being used to solve the word problem. 

5. Proof of Theorem [l 

Suppose that ip : G — > G' is a quasi-isometry of groups and that G' is 
narrow. It suffices to show that G is narrow too. Pick a ball B C G of 



radius r and let B be its image in G' . By Definition 2.1 B lies in a ball C 
of radius k(r + 1). Since G' is narrow, there is an integer i such that all but 
finitely many balls CL C G' of radius k(r + 1) are separated from C by i 
points. 



It follows from Definition 2.1 that (p is uniformly finite to one. Conse- 
quently the preimage in G of the union of the balls Cj not satisfying the 
separation condition is finite. Thus with a finite number of exceptions every 
ball B' C G of radius r has an image which lies in a ball C separated from 
C by a set S of size i. 

The image under ip of any path from B to B' is a sequence of points from 
C to C with each point a distance at most 2k from its successor. Add points 
to obtain a path from C to C and observe that this path must intersect 
S. It follows that at least one of the original image points is a distance at 
most k from S. In other words the preimage in G of all points within k of 
S disconnects B and B'. Since ip is uniformly finite to one, we are done. 



6. Proof of Theorem |1. 

We must prove the following three implications: 

1. A group which is wide is not virtually free. 

2. A group which is not virtually free has a one-ended subgroup. 

3. A group with a one-ended subgroup is wide. 

The first of these follows from the discussion immediately following the 



statement of the Theorem, and the second is a consequence of Lemma 2.2 . 
Thus it remains only to prove that if G contains a one-ended subgroup, then 
it is wide. Choose generators for G which contain generators of the subgroup; 
then it follows immediately from Definition [L^ that if G is narrow, so is the 
subgroup. Conversely if the subgroup is wide, so is G. Thus it suffices to 
prove that a one-ended group is wide. 
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Suppose to the contrary that G is one-ended and narrow. Assume we 
can choose the sets S in Definition [L^ so that the distance between any 
two points in S is uniformly bounded. It follows from the fact that G acts 
transitively and isometrically on itself by left translation that up to the 
action of G there are only finitely many different S's. Thus at least one 
S separates pairs of balls of arbitrarily large radius whence G — S has at 
least two infinite connected components contrary to the hypothesis that G 
is one-ended. 

We complete the proof by showing that we can choose the sets S as 
required. Observe that since G is one-ended and of bounded valence, G— {1} 
has one infinite component and finitely many finite ones. Some ball C of 
radius at least 1 around 1 contains all the finite components, and any two 
vertices on the boundary of C are joined by a path lying entirely in the 
infinite component. Pick one such path for each pair of vertices on the 
boundary of C, and pick a ball D around 1 containing C and all these 
paths. Let D have radius d. It is clear from Definition L2 that we may 
assume the balls to be separated are of radius greater than d. 

It does no harm to restrict attention to sets S which are minimal with 
respect to inclusion, and we do so. Pick balls B and B' of radius greater 
than d and separated by a set S. Suppose S contains a point v at a distance 
greater than d from all other points of S. Without loss of generality we 
assume this vertex is 1 whence Dfl5 = {l}. 

By minimality of S there is a path 7 from from B to B' with "fdS = {1}. 
If 7 starts outside C, define 71 to be the initial segment of 7 from B to a 
point x on the boundary of C. Otherwise BnC is not empty; and because B 
is connected and larger than C, there is a point x in B and on the boundary 
of C. In this case define 71 to be the path of length zero from x to itself. In 
both cases 71 goes from B to a point x on the boundary of C and intersects 
S trivially. Define 73 similarly from a point y on the boundary of C to B', 
and let 72 be a path from x to y lying in D — {1}. Clearly 717273 is a 
path from B to B' intersecting S trivially. Hence S cannot have a point v 
a distance greater than d from all other points of S. Since the sets S have 
uniformly bounded size, we are done. 



7. Proof of Theorem [O] 

Suppose that the word problem of G is accepted by a deterministic nested 
stack automaton with limited erasing A with q states and limited erasing 



constant K. To prove Theorem LI it is enough to show that G is narrow. 
Recall that the configuration graph A of A projects onto the Cayley diagram 
Q of G where Q is augmented by the addition of edge with label e from every 
vertex to itself. For any constant C there are only finitely many vertices in 
A with memory tree of at most C edges. Thus for any element g G G far 
enough away from 1 every vertex A projecting to g has memory tree with 
at least C edges. 
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By the action of G we may take B to be a ball around 1; and by discarding 
only finitely many possiblities for B' we may assume that the center of B' 
is an element g far enough away from 1 so that every vertex in the preimage 
in A of g has a memory tree with at least 2Kr + 1 edges. If necessary move 
g farther away so that B and B' are a distance at least 2 from each other. 

Fix a path 7 from 1 to g in Q. By Lemma [4.5| there is a unique shortest 
path 7 beginning at the initial vertex (go, Tq) of A and projecting to 7. By 
choice of g, 7 ends at (g, T) for some memory tree with at least 2Kr + 3 
edges. Recall that the edges of T are ordered, and let X" be the tree obtained 
by removing the Kr + 1 latest edges and taking the latest remaining vertex 
as distinguished. We claim that the images in G of all vertices of the form 
(q',T') separate B and B' . 

It suffices to show that any path beginning at the boundary of B' and 
ending at the boundary of B contains one of the desired vertices. Extend 
such a path by geodesies of length r to a path 71 from g to 1 in G, and 
consider the cycle 771. As the label w of this cycle represents the identity 
in G, there must be a path in A from (go, Tq) to a vertex (gi, To) with label 
w and gi a final state. Our choice of 7 insures that 7 is a prefix of this path. 
Consequently there is a path 71 from (g, T) to (gi,To) projecting to 71. 

Since A is limited erasing, the initial and terminal segments of 71 project- 
ing to the geodesic segments at the ends of 7 have length at most Kr. It 
follows at most Kr + 1 edges of T can be deleted along each of these initial 
and terminal segments. Since edges must be deleted in order, it follows that 
there is an edge not in these segments with label (q',T'). 
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